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Cosmic web-type classification using decision theory 
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We propose a decision criterion for segmenting the cosmic web into different structure types (voids, sheets, 
filaments, and clusters) on the basis of their respective probabilities and the strength of data constraints. Our 
approach is inspired by an analysis of games of chance where the gambler only plays if a positive expected 
net gain can be achieved based on some degree of privileged information. The result is a general solution 
for classification problems in the face of uncertainty, including the option of not committing to a class for a 
candidate object. As an illustration, we produce high-resolution maps of web-type constituents in the nearby 
Universe as probed by the Sloan Digital Sky Survey main galaxy sample. Other possible applications include 
the selection and labelling of objects in catalogues derived from astronomical survey data. 


I. INTRODUCTION 

Building accurate maps of the cosmic web from galaxy 
surveys is one of the most challenging tasks in modern 
cosmology. Rapid progress in this field took place in the 
last few years with the introduction of inference tech¬ 
niques based on Bayesian probability theory (Kitaura 
et al. , 2009; Jasche et al. , 2010; Nuza et al ., 2014; 
Jasche, Leclercq & Wandelt, 2015). This facilitates the 
connection between the properties of the cosmic web, 
thoroughly analyzed in simulations (e.g. Hahn et al ., 
2007; Aragon-Calvo, van de Weygaert & Jones, 2010; 
Cautun et al ., 2014), and observations (see Leclercq, 
Pisani & Wandelt, 2014, for a review on the interface 
between theory and data in cosmology). 

In Leclercq, Jasche & Wandelt (2015), we conducted 
a fully probabilistic analysis of structure types in the 
cosmic web as probed by the Sloan Digital Sky Survey 
(SDSS) main galaxy sample. This study capitalized on 
the large-scale structure inference performed by Jasche, 
Leclercq & Wandelt (2015) using the BORG (Bayesian 
Origin Reconstruction from Galaxies, Jasche & Wan¬ 
delt, 2013) algorithm. As the full gravitational model 
of structure formation COLA (COmoving Lagrangian Ac¬ 
celeration, Tassev, Zaldarriaga & Eisenstein, 2013) was 
used, our approach resulted in the first probabilistic and 
time-dependent classification of cosmic environments at 
non-linear scales in physical realizations of the large-scale 
structure conducted with real data. Using the Hahn et al. 
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(2007) definition (see also its extensions, Forero-Romero 
et al ., 2009; Hoffman et al ., 2012), we obtained three- 
dimensional, time-dependent maps of the posterior prob¬ 
ability for each voxel to belong to a void, sheet, filament 
or cluster. 

These posterior probabilities represent all the available 
structure type information in the observational data as¬ 
suming the framework of ACDM cosmology. Since the 
large-scale structure cannot be uniquely determined from 
observations, uncertainty remains about how to assign 
each voxel to a particular structure type. The question 
we address in this letter is how to proceed from the pos¬ 
terior probabilities to a particular choice of assigning a 
structure type to each voxel. Decision theory (see, for 
example, Berger, 1985) offers a way forward, since it ad¬ 
dresses the general problem of how to choose between 
different actions under uncertainty. A key ingredient be¬ 
yond the posterior is the utility function that assigns a 
quantitative profit to different actions for all possible out¬ 
comes of the uncertain quantity. The optimal decision is 
that which maximizes the expected utility. 

After setting up the problem using our example and 
briefly recalling the relevant notions of Bayesian decision 
theory, we will discuss different utility functions and ex¬ 
plore the results based on a particular choice. 


II. METHOD 

The decision problem for structure-type classification 
can be stated as follows. We have four different web-types 
that constitute the “space of input features:” {To = void, 
Ti = sheet, T 2 = filament, T 3 = cluster}. We want to 
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either choose one of them, or remain undecided if the data 
constraints are not sufficient. Therefore our “space of 
actions” consists of five different elements: {ao = “decide 
void,” cli = “decide sheet,” <22 = “decide filament,” <23 = 
“decide cluster,” and a_ 1 = “do not decide.”} The goal 
is to write down a decision rule prescribing which action 
to take based on the posterior information. 

Bayesian decision theory states that the action aj that 
should be taken is that which maximizes the expected 
utility function (conditional on the data d), given in this 
example by 
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U( aj (x k )\d) = Y J G{a j \ r T i )V(T i {x k )\d), ( 1 ) 

i =0 

where x & labels one voxel of the considered domain, 
V(Ti(xk)\d) are the posterior probabilities of the differ¬ 
ent structure types given the data, and G(aj |T$) are the 
gain functions that state the profitability of each action, 
given the “true” underlying structure. Formally, G is a 
mapping from the space of input features to the space of 
actions. For our particular problem, it can be thought 
of as a 5 x 4 matrix G such that G^- = G(aj |T^), in 
which case eq. ( 1 ) can be rewritten as a linear algebra 
equation, U = G.P where the 5-vector U and the 4- 
vector P contain the elements U j = U(aj(xk)\d ) and 
P^ = V(Ti{xk)\d ), respectively. 

Let us consider the choice of gain functions. Several 
choices are possible. For example, the 0/1-gain functions 
reward a correct decision with 1 for each voxel, while 
an incorrect decision yields 0. This leads to choosing 
the structure type with the highest posterior probabil¬ 
ity. While this seems like a reasonable choice, we need to 
consider that a decision is taken in each voxel, whereas 
we are interested in identifying structures as objects that 
are made of many voxels. For instance, since clusters are 
far smaller than voids, the a priori probability for a voxel 
to belong to a cluster is much smaller than for the same 
voxel to belong to a void. To treat different structures 
on an equal footing, it makes sense to reward the correct 
choice of structure type T^ by an amount inversely pro¬ 
portional to the average volume Vi of one such structure. 
In the following, we use the prior probability as a proxy 
for the volume fractions, 


V(T i) 


Vi 

Vo + Vt + Vz + Vs' 


( 2 ) 


We further introduce an overall cost for choosing a struc¬ 
ture with respect to remaining undecided, leading to the 
following specification of the utility, 


G(aj IT*) 


' 1 

—r— -a if j e [ 0 ,3] and i = j, 

-a if j e [0, 3] and i ^ j, ' ) 

l 0 if j = - 1 . 


This choice limits 20 free functions to only one free pa¬ 
rameter, a. With this set of gain functions, making (or 


not) a decision between structure types can be thought of 
as choosing to play or not to play a gambling game cost¬ 
ing a. Not playing the game, i.e. remaining undecided 
(j = —1), is always free (G(a-i\Ti) = 0 for all i). If the 
gambler decides to play the game, i.e. to make a decision 
(j G [0, 3]), they pay a but may win a reward, ^ , by 
betting on the correct underlying structure (i = j). 

In the absence of data, the posterior probabilities in 
equation ( 1 ) are the prior probabilities V(T *), which are 
independent of the position Xk , and the utility functions 
are, for j e [0,3], 


U(aj) 


and U(a-i) 
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52 G( a j |T») P(Tj) 

i =0 



n^j) 


5 > p ( T *) 


i=0 


1 — a 


3 \ 

P(Ti) + 5>(Ti) 

L Z = 0 j 

\ / 


1 - a, (4) 

0. (5) 


Equations (4) and (5) mean that, in the absence of data, 
this reduces to the roulette game utility function, where, 
if correctly guessed, a priori unlikely outcomes receive a 
higher reward, inversely proportional to the fraction of 
the probability space they occupy. Betting on outcomes 
according to the prior probability while paying a = 1 
leads to a fair game with zero expected net gain. The 
gambler will always choose to play if the cost per game 
is a < 1 and will never play if a > 1 . 

The posterior probabilities update the prior informa¬ 
tion in light of the data, providing an advantage to the 
gambler through privileged information about the out¬ 
come. In the presence of informative data, betting on 
outcomes based on the posterior probabilities will there¬ 
fore ensure a positive expected net gain and the gambler 
will choose to play even if a > 1. Increasing the param¬ 
eter a therefore represents a growing aversion for risk 
and limits the probability of losing. Indeed, for high <a, 
the gambler will only play in cases where the posterior 
probabilities give sufficient confidence that the game will 
be won, i.e. that the decision will be correct. 


III. MAPS OF STRUCTURE TYPES IN THE SDSS 

We applied the above decision rule to the web-type pos¬ 
terior probabilities presented in Leclercq, Jasche & Wan- 
delt (2015), for different values of a > 1 as defined by 
equation (3). In doing so, we produced various maps of 
the volume of interest, consisting of the northern Galactic 
cap of the SDSS main galaxy sample and its surround- 
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FIG. 1. Slices through maps of structure types in the late-time large-scale structure, at a = 1. The colour coding is blue 
for voids, green for sheets, yellow for filaments, and red for clusters. Black corresponds to regions where data constraints are 
insufficient to make a decision. The parameter a, defined by equation (3), quantifies to the risk aversion in the map: a = 1.0 
corresponds to the most speculative map of the large-scale structure, and maps with a > 1 are increasingly conservative. These 
maps are based on the posterior probabilities inferred by Leclercq, Jasche & Wandelt (2015) and on the Bayesian decision rule 
subject of the present work. 


ings. Slices through these three-dimensional maps 1 are 


1 For all slices shown, we kept the coordinate system of Jasche, 
Leclercq &; Wandelt (2015). 


shown in figure 1 for the late-time large-scale structure 
(at a = 1) and in figure 2 for the primordial large-scale 
structure (at a = 10 -3 ). 

When the game is fair (namely when a = 1), it is 
always played, i.e. a decision between one of the four 
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FIG. 2. Same as figure 1 for the primordial large-scale structure, at a = 10 3 . 


structure types is always made. This results in the spec¬ 
ulative map of structure types (top left panel of figures 1 
and 2). There, a decision is made even in regions that are 
not constrained by the data (at high redshift or outside 
of the survey boundaries), based on prior betting odds. 

By increasing the value of a > 1, we demand higher 
confidence in making the correct decision. This yields 
increasingly conservative maps of the Sloan volume (see 
figures 1 and 2). In particular, at high values of <a, the 


algorithm makes decisions in the regions where data con¬ 
straints are strong (see figures 3 and 6 in Leclercq, Jasche 
& Wandelt, 2015), but often stays undecided in the un¬ 
observed regions. It can be observed that even at very 
high values, a > 3, a decision for one structure is made 
in some unconstrained voxels (typically in favour of the 
structure for which the reward is the highest: clusters in 
the final conditions, and clusters or voids in the initial 
conditions). This effect is caused by the limited number 
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of samples used in our analysis. Indeed, because of the 
finite length of the Markov Chain, the sampled repre¬ 
sentation of the posterior has not yet fully converged to 
the true posterior. For this reason, the numerical rep¬ 
resentation of the posterior can be artificially displaced 
too much from the prior, which results in an incorrect 
web-type decision. This effect could be mitigated by ob¬ 
taining more samples in the original BORG analysis (for 
an increased computational cost); or can be avoided by 
further increasing ag at the expense of also degrading the 
map in the observed regions. We found the value of a = 4 
(bottom right panel of figures 1 and 2) to be the best 
compromise between reducing the number of unobserved 
voxels in which a decision is made to a tiny fraction and 
keeping information in the volume covered by the data. 

As expected, structures for which the prior probabil¬ 
ities are the highest disappear first from the map when 
one increases a: betting on these structures being poorly 
rewarded, this choice is avoided in case of high risk aver¬ 
sion. In the final conditions (figure 1), we found that 
sheets completely disappear for a « 1.68 and filaments 
for a ~ 4.01. In the initial conditions (figure 2), the 
critical value is around a ~ 2.36 for both sheets and fila¬ 
ments. In the most conservative maps displayed in figures 
1 and 2 (a = 4.0), the SDSS data provide extremely high 
evidence for the voids and clusters shown. In constrained 
parts, extended regions belonging to a given structure 
type may not have the expected shape. This is true in 
particular for filamentary regions. Several factors can 
explain this: first, slicing through filaments make them 
appear as dots; second, with the dynamic Hahn et al. 
(2007) definition, filament regions often extend out into 
sheets and voids, and their static skeleton geometry is 
not the most prominent at the voxel scale (3 Mpc jh in 
this work). 

As detailed in Jasche, Leclercq & Wandelt (2015), data 
constraints are propagated by the structure formation 
model assumed in the inference process (second-order La- 
grangian perturbation theory) and therefore radiate out 
of the SDSS boundaries. For this reason, for moderate 
values of <a, web-type classification can be extended be¬ 
yond the survey boundaries to regions influenced by data. 
This can be observed in figures 1 and 2, where one can 
see, for instance, that the shape of voids that intersect the 
mask is correctly recovered. Similarly, the classification 
of high-redshift structures confirms that the treatment 
of selection effects by BORG is correctly propagated to 
web-type analysis. 

We finally comment on the required computational re¬ 
sources for the complete chain for running BORG, com¬ 
puting the web-type posterior, and making a decision. 
Inference with BORG is the most expensive part: on av¬ 
erage, one sample is generated in 1500 seconds on 16 
cores (Jasche, Leclercq & Wandelt, 2015). Then, in each 
sample, tidal shear analysis (Leclercq, Jasche & Wandelt, 
2015) is a matter of a few seconds. Once the web-type 
posterior is known, making a decision, which is the sub¬ 
ject of the present letter, is almost instantaneous. There¬ 


fore, once the density field has been inferred, which is use¬ 
ful for a much larger variety of applications, our method 
is substantially cheaper than several state-of-the-art tech¬ 
niques for cosmic web analysis (e.g. the method of Tern- 
pel, Stoica & Saar, 2013; Tempel et al ., 2014, for detect¬ 
ing filaments). 

IV. CONCLUSIONS 

In this letter, we proposed a rule for optimal decision 
making in the context of cosmic web classification. We 
described the problem set-up in Bayesian decision the¬ 
ory and proposed a set of gain functions that permit an 
interpretation of the problem in the context of game the¬ 
ory. This framework enables the dissection of the cosmic 
web into different elements (voids, sheets, filaments, and 
clusters) given their prior and posterior probabilities and 
naturally accounts for the strength of data constraints. 

As an illustration, we produced three-dimensional tem¬ 
plates of structure types with various risk aversion, de¬ 
scribing a volume covered by the SDSS main galaxy sam¬ 
ple and its surrounding. These maps constitute an ef¬ 
ficient statistical summary of the inference results pre¬ 
sented in Leclercq, Jasche & Wandelt (2015) for cross-use 
with other astrophysical and cosmological data sets. 

Beyond this specific application, our approach is more 
generally relevant to the solution of classification prob¬ 
lems in the face of uncertainty. For example, the con¬ 
struction of catalogues from astronomical surveys is di¬ 
rectly analogous to the problem we describe here: it si¬ 
multaneously involves a decision about whether or not 
to include a candidate object and which class label (e.g. 
star or galaxy) to assign to it. 
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